add design doc for auto pass through hashagg #9295

guo-shaoge · 2024-08-07T07:41:52Z

What problem does this PR solve?

Issue Number: close #9296

Problem Summary:

What is changed and how it works?

Check List

Tests

Unit test
Integration test
Manual test (add detailed scripts or steps below)
No code

Side effects

Performance regression: Consumes more CPU
Performance regression: Consumes more Memory
Breaking backward compatibility

Documentation

Release note

None

Signed-off-by: guo-shaoge <[email protected]>

yibin87 · 2024-08-12T03:35:23Z

docs/design/2024-08-07-auto-pass-through-hashagg.md

+## Introduction
+The HashAgg pushed down to TiFlash can be a one-stage, two-stage, or three-stage. For two-stage and three-stage aggregations, the 1st hashagg is used for pre-aggregation to reduce the amount of data that needs to be shuffled.
+
+However, the optimizer cannot always choose the most suitable plan based on statistics. For example, it is common to encounter cases where two-stage HashAgg is used for datasets with high NDV, resulting in poor pre-aggregation effects in the 1st hashagg.


1st hashagg => 1st stage hashagg?
Since, previously, we use two-stage, three-stage.

yibin87 · 2024-08-12T03:44:53Z

docs/design/2024-08-07-auto-pass-through-hashagg.md

+### Core State Switch
+The 1st hashagg will follow the state transition diagram below, with each state having the following meanings:
+1. `Init`: The initial state, where it remains as long as the HashMap is smaller than a specific value(to make sure the HashMap can fit in the L2 cache). In this state, the incoming Block is inserted into the HashMap for pre-aggregation.
+2. `Adjust`: In this state, the Block is inserted into the HashMap while probing and recording the degree of aggregation for the Block. It will switch to `PreAgg` or `PassThrough`.


It will switch to '' , '', or 'Selective'?

Selective added

yibin87 · 2024-08-12T03:47:24Z

docs/design/2024-08-07-auto-pass-through-hashagg.md

+2. `Adjust`: In this state, the Block is inserted into the HashMap while probing and recording the degree of aggregation for the Block. It will switch to `PreAgg` or `PassThrough`.
+3. `PreAgg`: In this state, the Block is inserted into the HashMap. This state lasts for N rows(see code for N) before switching back to the `Adjust` state.
+4. `PassThrough`: In this state, the Block is directly put into the memory buffer. This state lasts for M rows(see code for M) before switching back to the `Adjust` state.
+5. `Selective`: For rows which can hit the HashMap, the aggregation function is calculated directly. For rows can't hit, they are put into the pass through buffer. So in this state, the HashMap does not grow.


Add change back to Adjust explanation?

zanmato1984 · 2024-08-12T05:39:27Z

docs/design/2024-08-07-auto-pass-through-hashagg.md

+* [Unresolved Questions](#unresolved-questions)
+
+## Introduction
+The HashAgg pushed down to TiFlash can be a one-stage, two-stage, or three-stage. For two-stage and three-stage aggregations, the 1st hashagg is used for pre-aggregation to reduce the amount of data that needs to be shuffled.


Suggested change

The HashAgg pushed down to TiFlash can be a one-stage, two-stage, or three-stage. For two-stage and three-stage aggregations, the 1st hashagg is used for pre-aggregation to reduce the amount of data that needs to be shuffled.

The HashAgg pushed down to TiFlash can be a plan of one-stage, two-stage, or three-stage. For two-stage and three-stage aggregations, the 1st hashagg is used for pre-aggregation to reduce the amount of data that needs to be shuffled.

I guess you are using "hashagg" (all lower-cased) to differentiate the plan level term "HashAgg" (camel-cased). If I'm guessing right, you may just use "aggregation" because "hashagg" is not a valid word.

done. All hashagg changed to aggregation

Signed-off-by: guo-shaoge <[email protected]>

yibin87

LGTM

zanmato1984

LGTM with two nits.

zanmato1984 · 2024-08-12T09:43:26Z

docs/design/2024-08-07-auto-pass-through-hashagg.md

+This functionality is accomplished through the `Block.info.selective` array. The DAGResponseWriter will use this array to determine which rows in a Block can be sent directly and which need to be ignored.
+
+### Spill
+If AutoPassThroughHashAgg is used, spilling will not occur. Once the HashMap grows large enough to require spilling, it will immediately trigger a forced pass-through(meaning all subsequent Blocks will be forced to pass through).


Suggested change

If AutoPassThroughHashAgg is used, spilling will not occur. Once the HashMap grows large enough to require spilling, it will immediately trigger a forced pass-through(meaning all subsequent Blocks will be forced to pass through).

If `AutoPassThroughHashAgg` is used, spilling will not occur. Once the HashMap grows large enough to require spilling, it will immediately trigger a forced pass-through(meaning all subsequent Blocks will be forced to pass through).

zanmato1984 · 2024-08-12T09:48:04Z

docs/design/2024-08-07-auto-pass-through-hashagg.md

+Additionally, the HashMap's Blocks will be prioritized for returning to the parent operator in order to quickly reduce memory pressure.
+
+## Impacts & Risks
+Since the algorithm judges the NDV of the overall dataset based on a small amount of data, it may lead to incorrect NDV estimation for some datasets, resulting in performance regression. In the future, this issue can be mitigated by introducing algorithms like LogLogCounting.


Suggested change

Since the algorithm judges the NDV of the overall dataset based on a small amount of data, it may lead to incorrect NDV estimation for some datasets, resulting in performance regression. In the future, this issue can be mitigated by introducing algorithms like LogLogCounting.

Since the algorithm estimates the NDV of the overall dataset based on a small amount of data, it may lead to incorrect NDV estimation for some datasets, resulting in performance regression. In the future, this issue can be mitigated by introducing algorithms like LogLogCounting.

ti-chi-bot · 2024-08-12T09:48:20Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: yibin87, zanmato1984

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [yibin87,zanmato1984]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ti-chi-bot · 2024-08-12T09:48:23Z

[LGTM Timeline notifier]

Timeline:

2024-08-12 05:46:31.230093581 +0000 UTC m=+160475.933563223: ✖️🔁 reset by zanmato1984.
2024-08-12 09:26:53.953821083 +0000 UTC m=+173698.657290745: ☑️ agreed by yibin87.
2024-08-12 09:48:22.212167768 +0000 UTC m=+174986.915637412: ☑️ agreed by zanmato1984.

guo-shaoge added 2 commits August 7, 2024 15:37

doc: add design doc for auto pass through hashagg

5a4b96c

Signed-off-by: guo-shaoge <[email protected]>

png

d52bd3d

Signed-off-by: guo-shaoge <[email protected]>

ti-chi-bot bot added do-not-merge/needs-linked-issue release-note-none Denotes a PR that doesn't merit a release note. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed do-not-merge/needs-linked-issue labels Aug 7, 2024

guo-shaoge requested review from yibin87 and gengliqi August 7, 2024 07:47

guo-shaoge added 2 commits August 7, 2024 20:08

update image

54e4303

Signed-off-by: guo-shaoge <[email protected]>

update image

d55bb22

Signed-off-by: guo-shaoge <[email protected]>

yibin87 reviewed Aug 12, 2024

View reviewed changes

zanmato1984 requested changes Aug 12, 2024

View reviewed changes

refine

4540efb

Signed-off-by: guo-shaoge <[email protected]>

guo-shaoge requested review from zanmato1984 and yibin87 August 12, 2024 07:17

yibin87 approved these changes Aug 12, 2024

View reviewed changes

ti-chi-bot bot added needs-1-more-lgtm Indicates a PR needs 1 more LGTM. approved labels Aug 12, 2024

zanmato1984 approved these changes Aug 12, 2024

View reviewed changes

ti-chi-bot bot added lgtm and removed needs-1-more-lgtm Indicates a PR needs 1 more LGTM. labels Aug 12, 2024

Merge branch 'master' into auto_pass_design

bd67dba

ti-chi-bot bot merged commit c485c37 into pingcap:master Aug 12, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add design doc for auto pass through hashagg #9295

add design doc for auto pass through hashagg #9295

guo-shaoge commented Aug 7, 2024 •

edited

Loading

yibin87 Aug 12, 2024

guo-shaoge Aug 12, 2024

yibin87 Aug 12, 2024

guo-shaoge Aug 12, 2024

yibin87 Aug 12, 2024

guo-shaoge Aug 12, 2024

zanmato1984 Aug 12, 2024

zanmato1984 Aug 12, 2024

guo-shaoge Aug 12, 2024

yibin87 left a comment

zanmato1984 left a comment

zanmato1984 Aug 12, 2024

zanmato1984 Aug 12, 2024

ti-chi-bot bot commented Aug 12, 2024

ti-chi-bot bot commented Aug 12, 2024

	The HashAgg pushed down to TiFlash can be a one-stage, two-stage, or three-stage. For two-stage and three-stage aggregations, the 1st hashagg is used for pre-aggregation to reduce the amount of data that needs to be shuffled.
	The HashAgg pushed down to TiFlash can be a plan of one-stage, two-stage, or three-stage. For two-stage and three-stage aggregations, the 1st hashagg is used for pre-aggregation to reduce the amount of data that needs to be shuffled.

	If AutoPassThroughHashAgg is used, spilling will not occur. Once the HashMap grows large enough to require spilling, it will immediately trigger a forced pass-through(meaning all subsequent Blocks will be forced to pass through).
	If `AutoPassThroughHashAgg` is used, spilling will not occur. Once the HashMap grows large enough to require spilling, it will immediately trigger a forced pass-through(meaning all subsequent Blocks will be forced to pass through).

	Since the algorithm judges the NDV of the overall dataset based on a small amount of data, it may lead to incorrect NDV estimation for some datasets, resulting in performance regression. In the future, this issue can be mitigated by introducing algorithms like LogLogCounting.
	Since the algorithm estimates the NDV of the overall dataset based on a small amount of data, it may lead to incorrect NDV estimation for some datasets, resulting in performance regression. In the future, this issue can be mitigated by introducing algorithms like LogLogCounting.

add design doc for auto pass through hashagg #9295

add design doc for auto pass through hashagg #9295

Conversation

guo-shaoge commented Aug 7, 2024 • edited Loading

What problem does this PR solve?

What is changed and how it works?

Check List

Release note

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yibin87 left a comment

Choose a reason for hiding this comment

zanmato1984 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ti-chi-bot bot commented Aug 12, 2024

ti-chi-bot bot commented Aug 12, 2024

[LGTM Timeline notifier]

guo-shaoge commented Aug 7, 2024 •

edited

Loading